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Abstract. We give an elementary introduction to some recent polyhedral 
techniques for understanding and solving systems of multivariate polynomial 
equations. We provide numerous concrete examples and illustrations, and 
assume no background in algebraic geometry or convex geometry. Highlights 
include the following: 

(1) A completely self-contained proof of an extension of Bernstein's Theo- 
rem. Our extension relates volumes of polytopes with the number of 
connected components of the complex zero set of a polynomial system, 
and allows any number of polynomials and/or variables. 

(2) A near optimal complexity bound for computing mixed area — a quan- 
tity intimately related to counting complex roots in the plane 



1. Introduction 

In a perfect world, a scientist or engineer who wishes to solve a system of 
polynomial equations arising from some important application would simply pick 
up a book on algebraic geometry, look through the table of contents, and find a 
well-explained, provably fast algorithm which solves his or her problem. (Algebraic 
geometry began 2000 years ago as the study of polynomial equations, didn't it?) 
He or she would then surf the web to download a good (free) implementation which 
would run quickly enough to be useful. 

Once one stops laughing at how the real world compares, one realizes what 
is missing: the standard classical algebraic geometry texts (e.g., |EGA1, EGA2, 
EGA3.lL |EGA3.2|, ^GA4.l|, [EGA4.2|, |EGA4.3|, |SGAl|, |SGA2|, gGA3.1, 



jGA4.l|, |SGA4.2|, |SGA4.3| , ^GA4:^| , |SGA5| , |SGA6| , |SGA7| , 
Har77 , Sha94 , GH94 1 ]) rarely contain algorithms and none 
contains a complexity analysis of any algorithm. Furthermore, one soon learns 
from experience that the specific structure underlying one's equations is rarely if 
ever exploited by a general purpose computational algebra package. 



SGA3.2, SGA3.3 



Mum95, 



Mum99, 



This research was partially supported by a grant from the Texas A&M College of Science 
and NSF Grant DMS-0211458. 

^In fairness, it should be noted that the major thrust of 20— century algebraic geometry 
was understanding the topological nature of zero sets of polynomials, rather than efficiently 
approximating the location of these zeros. 
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Considering the ubiquity of polynomial equations in applications such as geo- 
metr ic mode lling [[IV[an98| , |Gol03| control theory |Sus98|, | NM99[ cryptogra- 
phy | DodOl[, ra dar imaging | FH95|, l earning theory |Vid97, VR02|, chemistry 
|GH99| , [GatOlt , game theory | |McL97| , [Roj97t , and kinematics lCan93| , |EM99| 

(just to mention a few applications), it then becomes clear that we need an algo- 
rithmic theory of algebraic geometry that is practical as well as rigourous. One 
need only look at the active research in numerical linear algebra (e.g., eigenvalue 
problems for large sparse matrices) to see how far we are from a completely satisfac- 
tory theory for the numerical solution of general systems of multivariate polynomial 
equations. 

More recently, the introduction of algorithmic and combinatorial ideas has in- 
vigorated computational algebraic geometry. Here we give an elementary introduc- 
tion to one recent aspect of computational algebraic geometry: polyhedral methods 
for solving systems of multiv ariate p olynomial equat ions. The buzz-word for the 
cogniccnti is toric varieties |Ful93, Cox03| , Sot03 . However, rather than deriv- 
ing algorithms from toric variety theory as an afterthought, we will begin directly 
with concrete examples and see how convex geometry naturally arises from solving 
equations. 

Simply put, polyhedral methods are a first step toward a new class of algo- 
rithms which adapt themselves to the intrinsic nature of the underlying system of 
equations. For the purposes of this paper, this means that our polynomial equa- 
tions will be expressed as sums of monomial terms, and the techniques we describe 
will exploit the combinatorial structure of which monomial terms appear. 

Example 1.0.1. Suppose one has the following 3 equations in 3 unknowns x, 
y, and z: 

C2,i + C2,2X + a^i + a/ + Ci^iX^fz' + Ci^ix'^y'x^ + C2.^x''yh'^ + C2,sx^iz^ + 02,9x^2'' + cj^ioi'/z' + cmxYh^ + cj.u^'l/'z'" = 
C3,i + C3,2i + cui + C3,4Z^ + C3,5i^!/^z^ + c^fX^x^ + c^jx^^^ + ^3,81*^^ + c^/^z'^ + ci^xY^^ + c^^nY^z'^ + Cui^^y^z^" = 0, 

where the Ci^a o,re any given complex numbers. One may reasonably guess that 
such a system of equations, being neither over-determined or under-determined, will 
have only finitely many roots {x,y,z) G C'^ with probability \, for any continuous 
probability distribution on the coefficient space C^^ . In fact, wit h prob abili ty 1, t he 
number of roots will always be the same number (cf. Theorems 3.2.1 and 3.2.i of 
Section S.i). What then is this "generic" number of roots? 

Noting that the maximum of the sum of the exponents in any summand of the 
first, second, or third equation is 2 8 (i.e., our polynomials each have total degree 
28), a classical theorem of Bezout [|Sha94 , Ex. 1, Pg. 198] gives us an upper bound 
0/21952 = 28^. A slightly more refined variant which uses degrees with respect to 
different variables, the multi-graded version of Bezout 's Theorem [MS87|, yields 
a sharper upper bound 0/ 6000 = 6 • 10"^. 

However, the true generic number of roots is 321. This number was calculated 
by using the correct concept in our setting: the convex hull^ of the exponent vectors 
(also known as the Newton polytopesj of our polynomials. In this case, all our 



^ Recall that a set _BCR" is convex iff for all x,y&B, the line segment connecting x and 
y is also contained in B. The convex hull of B, Conv(_B), is then simply the smallest convex 
set containing B. and t he computational complexity of convex hulls of finite point sets is fairly 
well-understood [PS851. 
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Newton polytopes are identical, and the volume (suitably normalized) of any one 
serves as the correct generic number of complex roots. 




Figure 1. Several views of the Newton polytope shared by our three equations 
The key idea to keep in mind is that the complexity of solving a system of 
polynomial equations, or even approximating a single root, depends strongly on the 
total number of complex roots. Since one does not usually know this number a 
priori, the algorithm one uses ultimately makes implicit use of some upper bound 
on this number, usually one of the three we just mentioned. So, much as how our 
preceding comparison of bounds turned out, algorithms which take monomial term 
structure into account are preferable over those that do not. o 

A natural question, especially relevant to geometric modelling, then arises: Is 
there an analogous theory for systems of equations expressed in other bases? In 
particular, the systems of equations arising from i?-splines are expressed in the 
so-called Bernstein-Bezier basis which uses sums of terms like Yiii^ ~ ^i^^x^K 
The short answer is that an analogous theory for s uch bas e s docs no t yet ex - 
ist. However, the philo sophies of fcwnom ial theory Kho91 , LRW03 , RojOSj, 



straight - line program s J Roj02 



JKSS03f|, no t to mention poly hedral methods 



Ro.i94 |HS95| , [Li97| , |Ro.i99b| , |VerOO| , |EP02| , |McD02| , |MR03|1 , are bringing us 
closer to a theory that can efficiently handle such questions. 

We now outline the main results wc explain in our paper. The second result 
below is new, while the first is older. (Related earlier results will be reviewed 
throughout this paper.) However, we emphasize that the statement and proof of the 
first result has been considerably simplified, we provide many more illustra t ions and 
example s than what is u sually found in the earlier literature (e.g., [ Ber75 , Kus75 , 
Kus76, Pul93, HS95 ), and we have made an effort to keep all prerequisites 



explicit and contained in this paper. 



Notation 1.0.1. 
1" and let ei,...,t 



"I" 




"0" 
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Let denote the origin in 
,1 denote the standard basis vectors 

Also, for anyBCT, let Conv(B) 



denote the smallest convex set containing B. Also, 
we let Vol(-) denote the usual n- dimensional volume 
inV, renormalized so ttfliVol(Conv({0,ei,...,e„}) = l. 
Finally, we will abuse notation slightly by setting 
Vo1(j4) := Vol(Conv(j4)) whenever A is a finite 
subset o/M". 



Notation 1.0.2. For any c G C* := C\{0} and 
a = (fli, . . . , a„) e Z", let i" := ij' ■ ■ ■ xJJ" and call 
ci" a monomial term. Also, for any polynomial 
of the form f[x) ^laeA ^ai", we call Supp(/) := 
{a I Ca^O} support off, and define 'Nmt{f):= 
Conv(Supp(/)) to be the Newton polytope of f. 
We will assume henceforth that F :~ (/i,...,/^) 
where, for alii, /,gC[.ti,...,3;„] andSu'p]){ft)QAi. 
We call F akxn polynomial system (over C) 
with support {Ai,...,Ak). Finally, we let Zir{F) 
denote the set o/sGC" with fi{x)- ■■■ ~fk{i)~0. o 



Theorem 1. (Special Case (full version in Sec. ^)) Following the notation 
above, the number of connected components of Zc{F) is no more than Yo\{B), 
where B .= {0, ei, . . . , e„} U 1J*L]^ Ai. In particular, if the number of complex roots 
of F is finite, then it is no more than Vol(B). 
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As might be expected, a sharper estimate on the generic number of complex 
roots comes at a price: the resulting formula is more difficult to evaluate. However, 
one can get an explicit and optimal complexity estimate for the case of a pair of 
bivariate equations. 

Theorem 2. Following the notation of Theorem^, suppose k^n~2. Then the 
generic number of complex roots of a polynomial system F = (/i, with support 
{Ai,A2) can be computed within 0{bN + NlogN) bit operations, and takes at least 
n[bN) bit operations in the worst case, where N denotes the sum of the cardinalities 
of the Ai, and b is the maximum bit-length of any coordinate of any Ai. 

Numerous examples of our two main theorems will appear as we review some of 
the background necessary for the applications and proofs of our theorems. Theorem 
is proved three times: the simplified version above is proved in Section ^ and 
then two proofs of the full version appear respectively in Sections ^ and ^ The last 
proof uses the main combinatorial construction detailed in this paper: the mixed 
subdivision of n polytopes in n dimensions. Theorem 2 is proved in Section ^ as 
a simple consequence of mixed subdivisions in the plane. 

2. Prom Binomial Systems to Volumes of Pyramids 

Perhaps the best and simplest place to begin to understand the connection 
between polytopes and polynomials is the special case of binomial systems, i.e., 
polynomial systems where each polynomial has exactly 2 monomial terms. For such 
systems, there is an immediate connection to linear algebra over the integers. 



Example 2.0.2. Suppose we want to find all the complex solutions of 

Cl 



7 7 4 

xy z w 



x^y'^z^w^ 
x^y'^z^w^ 



C2 
C3 
C4 



where the Ci_j are given nonzero complex numbers. Note in particular that this im- 
plies that any root of our system must satisfy xyzw^O. A particularly elegant trick 

Tl 7 7 4r 

we 11 generalize shortly is the following: Consider the Ax A matrix E 



6 4 9 6 
2 3 2 6 
6 4 8 5 



whose i— row vector is the exponent vector of the i— equation above. Then multi- 
plying and dividing the equations above is easily seen to be equivalent to performing 
row operations on E. For example, doing a pivot operation to zero out all but the 
top entry of the first column of E is just the computation of the matrix factorization 

" 1 O O 0~| [1 7 7 4"| [1 7 7 4 ^ 

— 6 1 6 4 9 6 —38 —33 —18 

— 2 010 2 3 26~0 —11 —12 —2 

— 6 ij [6 4 8 sj [o —38 —34 —19 

to observing that 



, which is in turn equivalent 



Equation 1 is... 
(Equation 2)/ (Equation if is... 
(Equation 3)/ (Equation if is... 
(Equation 4) /(Equation ij 



X 



Cl 



6 



IS... 



-33 
-12 
-34 



W 



-18 



w 



-2 



y z w 

y^^^ z 

y-38 2:^"" 

Note also that this new binomial system has exactly the same roots as our original 
system. (This follows easily from the fact that our left-most matrix above is invert- 
ible, and the entries of the inverse are all integers. j 5*0 we can solve the last 3 



-19 



Cl ^C3 



C4 
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equations for {y, z, w) and then substitute into the first equation to solve for x and 
be done, o 

Note, however, that if we wish to complete the solution of our example above, we 
must continue to use row operations on E that are invertible over the integers]^ 
This can be done by performing a simple variant of Gauss- Jordan]^ elimination where 
one uses no divisions. In essence, one uses elementary integer row operations to 
minimize the absolute value of the entries in a given column, instead of reducing 
them to zero. 

Example 2.0.3. Let us consider the lower right 3x3 block of our last matrix 
from our last example. The variant of Gauss-Jordan elimination we propose is 
easily depicted below. 

c 



"-38 


-33 


-18 




Y'T 


■-11 


-12 


-18 


_ c 


■-11 


-12 


-18" 




■-1 -18 


22 ■ 


-11 


-12 


-2 






-38 


-33 


-18 


-5 


3 


-12 


-^tlt...^ 


93 


-122 


-38 


-34 


-19 






-38 


-34 


-19 




-5 


2 


-13 




-1 


-1 



Proceding with the same strategy on the lower 2x2 block of the last matrix, and 
then con tinuin g recursively, one finally obtains that our system of equations from 
Example 2.0.i is equivalent to the following simpler system: 



y 



62 
175 



215 



-10 



„23 



^82 



„11 

''3 

^31 



^38 



-26 
-75 



-93 



Note in particular that the exponent vectors above correspond exactly to the third 
and first matrices in the identity below: 



—3 23 11 

—8 66 31 

10 

-10 82 38 



-26 
-75 
-1 
-93 



17 7 

6 4 9 

2 3 2 

6 4 8 



1 O O 62 

10 175 

1 1 

215 



and that the left-most matrix U has determinant ~1 ( and thus the entries of U ^ 
are all integral). One should of course note that our system is now very easy to 



solve: since the roots of w" 



c are 



{I- 



n/d 



^1 l/dg27ri/d 



\l/d 2iri{d-l)/d 



can simply substitute the 215 resulting values for w into the first, second, and third 
equations to solve for x, y, and z, thus finding all 215 complex roots of our .system, o 

This motivates the following definition from 19— century algebra. 

Definition 2.0.1. Let Z"^" denote the set of allnxn matrices with all entries 
integral, and let GL„(Z) denote the set of all matrices in Z"^" with determinant 
±1 (the set o/ unimodular matrices). Recall that any mx n matrix [utj] with 
Uij =0 for all i> j is called upper triangular. Then, given any M £ Z"^", we 
call any identity of the form UM ~ H , with H = [hij] S Z"^" upper triangular and 
C/eGL„(Z), a Hermite factorization of M . Also if, in addition, we have: 

(1) hij>0 for all i, j . 

(2) for all i, if j is the smallest j' such that hij' ^0 then hij >hiij for all i' <i. 
then we call H the Hermite normal form of M. o 



While one could simply use rational operations on E, and thus radicals on our equations, 
this quickly introduces some unpleasant ambiguities regarding choices of d— roots. Hence the 
need for our integrality restriction. 

*Recall that Ga ussian elimination is what one docs to reduce a matrix to upper triangular 
form (see Definition 2.0. ij below ), while Gauss- Jordan elimination is what one does to reduce a 
matrix to diagonal form [ptr98|. 
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Lemma 2.0.1. [[Ii89] For any A/eZ"^", a Hermite factorization can be com- 
puted within 0{{n + hM)^'^"^^) bit operations, w/iere /ia/ :=log(2n + max |my |) and 

M = [TOy]. Furthermore, the Hermite normal form exists uniquely for M, and 
can also be computed within the preceding bit complexity bound. ■ 

By extending the tricks from our last examples, we can easily obtain the fol- 
lowing lemma. 

Lemma 2.0.2. S'wppose oi, . . . , a„ g Z" anrf ci, . . . , c„ e C* := C \ {0}. Let E 
denote the n x n matrix whose i— row is the vector a.i. Then the complex roots of 
the binomial system F: = [x""^ — Ci, . . . , x"" — c„) are exactly the complex solutions 
of the binomial system 



where [uij\E = [hij\ is any Hermite factorization of E. In particular, the complex 
roots of F can be expressed explicitly as monomials in i/ci, . . . , i/c^, where h := 

Letting (C*)" := (C \ {0})", we then easily obtain the following corollary. 



Definition 2.0.2. Given any kxn polynomial system F, its Jacobian matrix 



is Jac(F) := 



We then say that a root C e C" of F is degenerate 



g/t . . . g/t 
i/f rank Jac(i^)|a;=^<fc, anrf smooth otherwise, o 



Corollary 2.0.1. Suppose F — (/i, . . . , /„) is any nx n binomial system and, 
for all i, Vi is either vector defined by the difference of the exponent vectors of fi. 
Then F has only finitely many roots in (C*)" =^ F has exactly |detAf| many, 
where M is the nx n matrix whose i— row is v^. In particular, the last quantity is 
exactly 

Vol({0, «!,...,«„}). 

Also, every root of F in (C*) is non- degenerate iff F has exactly Vol({0, wi, . . . , 
roots in (C*)". Finally, fixing the support of F , there is an algebraic hypersurface A 
in the coefficient space C^" such that for all coefficient specializations specializations 
outside of A, F has exactly Vol({0, vi, . . . , «„}) roots in (C*)". ■ 

We illustrate the last portion of our corollary with the following example. 

Example 2.0.4. Let us find all (ci.i,ci^2,C2.i,C2,2,C3,i,C3.2)€C® such that 

C2,ia^'*y"^^° = C2,2 
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has infinitely many solutions in (C*)"^. In particular, by Lemma 2.0.i, the roots of 
our system are exactly those of 

2 7 5 Ci,2 

X y z = 

Cl,l 

4 

18 6 _ / Cl.2 \ / C3,2 



Cl,l/ \C3,1 
2 

Cl,2 \ C2,2 



y Z 

1 = 

,Cl,iy C2,l 

Clearly then, our system has infinitely roots in (C*)"^ *if 2^2,1 = C2,2Ci i (and no 
roots whatsoever if cf 2^2,1 y^C2,2Ci i). So in this example, we can take 

A={(ci,i,Ci,2,C2,l,C2,2,C3,l,C3,2)eC'' | Ci_2Cl,2 = C2,2C?,i } . O 

We conclude this section with a similar result for a slightly more complicated 
class of polynomial systems. 

Definition 2.0.3. Let F be any kxn polynomial system with support (Ai, . . . , Ak). 
Then we say that F is of type (mi, . . . ,Tnfc) iff =ffAi = mi for all i. Also, we that F 
is unmixed iff Ai = ■ ■ ■ = A^^. Finally, writing fi{x) ~ X^ogA ^i.a f'^'"' '^^^ ™^ ■'''^V 
a property V regarding F holds generically iff there is an algebraic hypersurface 
ncC^'*'^^ such that {c,^a I ie{l,...,n}, aeA,)e&'^*^^\l-l =^V holds, o 

Proposition 2.0.1. Following the notation above, ifVi, . . . ^Ve are properties 
of F that hold generically, then their conjunction Vi A ■ ■ ■ AVe holds generically as 
well. ■ 

Corollary 2.0.2. Given any unmixed nxn polynomial system F= (/i, . . . , /„) 
of type (n + 1, . . . , n + 1), let A be the support of any fi. Then F either has 
exactly Vol(^) roots in (C*)", no roots in (C*)", or infinitely many roots in (C*)". 
Furthermore, the first possibility holds generically and implies that all the roots of F 
in (C*)" are non- degenerate. Finally, however many roots F has in (C*)", they can 
always be expressed explicitly as monomials in \o\{A)— -roots of linear combinations 
of the coefficients of F (and possibly some additional free parameters). 

Proof: By Gauss- Jordan elimination, F is equivalent to a binomial system (i.e., 
one considers the monomials of F as new variables, thus obtaining a linear system 



that we can place into reduced row echelon form). So by Corollary 2.0.1, and some 
additional care with the Hermitc normal when F has infinitely many roots, we are 
done. ■ 



Corollary 2.0.2 will be the cornerstone of our proof of the special case of Theo- 
rem where k = n and F is unmixed (also known as Kushnirenko's Theorem). 
Not e in pa rticular that any Newton polytope from a polynomial system as in Corol- 
lary ^.0.2| , when Vol(yl) >0, is an n-simplex in R", i.e., the n-dimensional analogue 



of a 3-dimensional pyramid with a triangular base. 



3. Subdividing Polyhedra and Kushnirenko's Theorem 



Here we prove the following central result which gives a strong connection 
between polytope volumes and the number of complex roots of polynomial systems. 
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Theorem 3.0.1 (Kushnircnko's Theorem). Following the notation of Defini- 
tion 2. O.J of Section @, suppose F = (/i, . . . , /„) is n x n polynomial system with 
Supp(/i) C A for all i. Then F has only finitely many roots in (C*)" =^ F has 
<Vol(^) roots in (C*)". Furthermore, for fixed A, F generically has exactly Yo\{A) 
roots in (C*)". 



This resuh is originaUy due to Anatoly Georievich Kushnirenko |Kus75, Kus76|. 
So while we certainly claim no originality in our proof, we have strived to sim- 
plify the known proofs from the literature and include as much of the necessary 
background as reasonably possible. 

Before laying the technical foundations for our proof, let us first see a concrete 
illustration of the main ideas. In essence, one proves Kushnircnko's Theorem by 
deforming F (preserving the number of roots along the way) into a collection 
of simpler systems. Making this rigourous and efficient then provides a natural 
motivation for a new space (containing an embedded copy of (C*)") in which our 
roots will live. 

Example 3.0.5. Consider the special case n~2 with 



fiix,y):=-2 + x^ -3y + 5x''y^ + Ax'^y^ 

f2{x, y):=3 + 2x^ +y + 4a; V + '^x^'' . 
The Newton polygon boundary and support appear below: 




Supp(/2) = A 



According to Theorem 3.0.1, F either has < 35 roots in (C*)^ or infinitely many. 
(The standard and multi-graded Bezout bounds respectively reduce to 169 = 13^ and 
98 = 2 • 7 • 1 .) The true number of roots for our example turns out to be exactly 
35, and these roots are all non-degenerate. 

To see why this is so, let us define atoric deformation F:=(fi, /2) as follows: 



fi{x,y,t):=- 
f2{x,y,t):= 

and A :— Supp(/i) = Supp(/2 



-2t 
3t + 



2x'- 



Ax''y^ 4 



f Ax^y'^t 
2xS't, 








to 









7 




6 












1 




5 




7 


1 



















1 



(We will 



later 



in Lemma 3.1.1 that most choices of powers of t for each monomial would have 
worked just as well for proving Kushnircnko's Theorem for our example.) Note 
that F{l,x,y) = F{x,y). Intuitively, one would expect 2 equations in 3 unknowns 
to generically define a curve (cf. Theorem 3.2.1 and the Implicit Function Theorem 
from calculus), and this turns out to be the case for our example. More to the point, 
the number of roots of F in (C*)^ is constant for all iGC\E, where S is a finite 
set not containing 1.^ So to show that F has 35 roots in (C*)^, it suffices to show 



This crucial fact is elaborated a bit later in this section — specifically, Lemma 3.2.1 
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that the number of roots of F in (C*)^ is 35 for some suitable fixed t. At least 
initially, this seems no easier than counting the roots of F. 

The key trick then is to count something else which, for fixed t sufficiently close 
to 0, is easily provable to be the same as the number of roots of F in (C*)^. This 
is where polyhedral subdivisions come into play almost magically. First, note that 
our new system is still unmixed but now has a i- dimensional Newton polytope: 





The Lower Hull of Conv(yl) The Upper Hull of Conv(yl) 

(View from (0, 0, t) with t < (View from (0, 0, t) with t > 

Next, note that any root {x,y,t) € (C*)^ of F lies on a parametric curve of the 
form C(^xo,yo,w) ■= {{s'^^Xq, s^^yo, s^^) \ sGC*}. However, we will see momentarily 
that the set ofwGl/^ for which the roots of Ft in (C*)^ approach a C(xo,yo,ii)) as 
t — > is dictated by the face structure of Com/{A), and all the roots of F in (C*)^ 

approach some C (^XQ ,yo .lo) ^ ^ 0- ^ 

Let us pause now to review some basic convex geometric definitions. 

Definition 3.0.4. A (closed) half-space (with (inner) normal a=(ai, . . . , a, 

Ha C K", is any set of the form {{yi, . . . , j/„) SM" | a^yi + ■ ■ ■ + a„j/„ > c} for some 
real number c. A polyhedron is any finite intersection of half-spaces. Also, a d- 
flat in M" is any translate of a d-dimensional subspace ofM.". Finally, a (convex) 
polytope in M" is the convex hull of any finite point set in R", and an n-simplex 
is the convex hull of any n + 1 points which do not lie in an (n — l)-flat. o 

Definition 3.0.5. For any w := {wi, . . . ,w„) G R" and any compact set B C 
R", we let — the face of B with (inner) normal w — be the set of all 
y := (yi, . . . , ?;„) e B minimizing the inner product w ■ y := Wiyi + ■ ■ ■ + Wnyn . We 
call a face Q of B lower (resp. upper) iff the last coordinate of any inner normal 
of Q is positive (resp. lower). Also, the dimension of a face Q of B, dimQ, 
is the dimension of the smallest fiat containing Q. Finally, for any d-dimensional 
polytope, its faces of dimension 0, 1, d — 2, d — 1, or d are respectively called 
vertices, edges, ridges, facets, or improper, o 

The most important thing we'll do with polytopes, after taking their faces, is 
to subdivide them. 

Definition 3.0.6. A subdivision of a polytope P is a collection of polytopes 
{Qi}iLi called cells satisfying the following conditions. 



(1) U Q^ = P 



(2) ForanyweR" and j e {1, . . . ,Af} , we have QJ e {Q.jfL^ 

In particular, if all the Qi are simplices then we say that {Qi\'iLi is a triangulation. 
Finally, we call a cell Qj of {Qi}^Li full-dimensional ijff dimQj =dimP. o 
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In particular, it is clear that one way to compute the volume of a polytope is to 
take any triangulation and add the volumes of all its full-dimensional cells. From 
basic linear algebra, wc know that this reduces to a finite sum of absolute values of 
determinants of matrices of edge vectors. 

Example 3.0.5 (Continued). Let us now examine the lower hull of A, projected 
onto the {x,y)-plane, and its inner lower facet normals. 




i'=(4. -7,18) 




In particular, the projections of the faces of the lower hull of A onto A induce a 
triangulation {Qi\ o/Conv(yl). 

Picking w = {l, 2,2) to examine the curves C(^x^ y^^^-^, we see that F(s^'^xo, s'^'^yo, s 
is exactly 

s^(— 2 + Xq — 3yo) + Higher Order Terms in s 

s^(3 + 2xq + t/o) + Higher Order Terms in s. 

In particular, the {xQ,yQ)£ (C*)^ which tend to a well-defined limit as s — > while 
satisfying F{s^xq, s'^yo, s'^) = must also satisfy 

-2 + - 3yo 
3 + 2x1 + yo- 

(This follows easily from the Implicit Function Theorem upon observing that the 
roots of (—2 + Xq — 3yo, 3 + 2xq + yo) are all non-degenerate.) So by Corollary 



2.0. e: of the last section, there are exactly Vol({(0, 0), (2, 0), (0, 1)}) — 2 such points. 
Put another way, the number of {xo,yo) € (C*)^ for which F has roots in (C*)^ 

approaching C(a;j, 2,2)) o,s t > is exactly 2. 

Let us call the last system an initial term system and observe that its New- 
ton polytope is exactly the cell of {Qi} corresponding to w = (1,2,2). Proceed- 
ing similarly with the other inner lower facet normals of A, there are exactly 
Vol({(0, 1), (7, 5), (6, 7)}) = 18 curves of the form C'(2,g ,^^ (4 _7 ig)), and exactly 
Vol({(2, 0), (0, 1), (7, 5)}) = 15 curves of the form C(a;Q^yQ_(o,o,i)), approached by roots 
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of F in (C*)'^ as t — > 0. Also, the last two initial term systems have Newton poly- 
tope respectively equal to the cell of {Qi\ with inner lower facet normal (4,-7,18) 
or (0,0,1). 

To conclude, note that w not a multiple of (1, 2, 2), (4, —7, 18), or (0, 0, 1) 
the resulting initial term system has a Newton polytope of dimension < 1. Since 
C(x„,y „,w) = C(^xo,yo,aw) for any aGZ and w(zT? , another application of Corollary 



2.0.^ then tells us that we have found all C(xa,va,w) (with {xQ,yQ)^{C*)'^ andw&Z,'^ ) 
that are approached by roots of F in (C*)'^ as t — > 0. Since there are 35 = Vol(A) 
such curves, and since they don't intersect at any fixed t, this implies that Ft has 
exactly 35 roots in (C*)^ for any t with \t\ sufficiently small. So, assuming every 
root of F in (C*)^ converges to some C(^xo,vo,w) '^^ ^ — * ^ exactly 35 roots 
and we are done, o 

The preceding argument can be made completely general (not to mention 
rigourous) with just a little more work. In particular, we can prove our last assump- 
tion by constructing a space in which the roots of F all converge to well-defined 
limits as t — > 0. This is one of the main motivations behind toric varieties, 
which provide a useful and elegant way to compactify (C*)". 

Remark 3.0.1. The idea of examining the behavior of the zero set of F along 
a monomial curve is really not so far-fetched. In many practical computations 



(see, e.g., [LW91, Li97]j, some roots of polynomial systems depending on param- 
eters tend to have unbounded coordinates, even if the parameters are all finite. So 
monomial curves are a natural approach to formalize what it means for a root to 
"approach infinity. " o 

3.1. Enter Toric Varieties Corresponding to Point Sets. 

Before going further, let us first give a more succinct definition of initial term 
systems and formalize our constructions of A and F. 

Definition 3.1.1. For any f£C[xi, . . . ,x„] of the form^^^^^CaX°- , letitsini- 
tial term polynomial with respect to the weight w be Init„,(/) ■=J2aeA^ CaX°'. 
o 

Definition 3.1.2. Given any Ac Z", a lifting function for A is any func- 
tion Lo : A — > R" and we let A :~ {(a,a;(a)) | a £ A}. Also, letting n : 

M."^^ > R" denote the natural projection which forgets the last coordinate, we 

call A^ := |Conv(7r(i"')) | w€R"\{0}| the subdivision of Conv(A) induced 
by u). Finally, we say that to is a generic lifting iff A^ is a triangulation of 
Conv(A). o 

Definition 3.1.3. Following the notation above, if we have in addition that 
uj{A) C Z", then for any polynomial f{x) ~ '^aeA^a.x"' , it's lift with respect 
to u) is the polynomial f{x,t) := X^aeA '^o^"^'^^"'' ■ Finally, the lift with respect 
to (wi, . . . ,a;„) of a k X n polynomial system F := (/i,...,/^) is simply F := 
(/i, . . . , fk), where fi is the lift of fi with respect to uji for all i. o 



Lemma 3.1.1. Following the notation of Definition 3.1.i, we have that for any 
fixed A, generic lifting functions occur generically. More precisely, there is a finite 
union, TIa, of proper flats in R#"* such that iLj{A) G R'^"^ \ Tl uj is a generic 
lifting for A. M 
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We will now refine the approach of Example 3.0.^ as follows: After building A 
and F via a generic lifting function, we will build a new point set A and a space 
with the following properties: 

(1) Yj is compact 

(2) There is an h-to-1 map from (C*)" to a dense open subset of Y^, for some 
positive integer h. 

(3) F has a well-defined complex zero set Z in Y^. 



(4) There is a natural map tt 



■ c 



wher 



= C U {oo} is the usual 



complex projective line, such that for all to G C*, ^#(7r ^(to) H Z) is 
exactly the number of roots of F in (C*)" with i-coordinate to. 

Our proof of Kushnirenko's Theorem will then focus instead on (a) showing that 
#(7r-i(l) n Z) = #(7r-i(0) n Z) generically, and (b) computing #7r-i(i) at t = 
to avoid the use of limits. We've actuall y alre ady seen an example of (b), from 
an elementary point of view, in Example 3.0.5 of the last section. So let us now 
elaborate the framework needed for (a). 

Definition 3.1.4. Let N:=i^A and let 

Pc :={[2^o : • • • : X]\j] \ xq, . . . , xjv G C not all 0} 

denote complex N -dimensional projective space. Then, given any finite subset Ag 



IP , we let ifA '■ (C*)" — > P,^~^ — the generalized Veronese map — be the 
map defined by x t—^ : • • • : a;""], where {ai, . . . , aj^} = A. We then let Ya — 
the toric variety corresponding to the point set A — denote the closure of 
ipAiiC*r) inV^-\ o 

Being a closed subset of a compact space, we thus see that Ya is compact as 
a topological space and this will be important later for guaranteeing that certain 
limits of curves exist. However, one may wonder if Ya actually compactifies (C*)" 
in any reasonable way and what the closure above really means. Here's one way to 
make this precise. 



Lemma 3.1.2. Following the notation of Definition 3.1.4, let [an, . . . ,ai„]=ai 
for all i. Also let E (resp. E) be the N x n (resp. N x [n + l)) matrix whose i— 
row is {an, . . . , Oin) (resp. (an, . . . , am, 1) )■ Finally, let H be the Hermite normal 
form of E, let UE = H be any Hermite factorization of E, and let Ui (resp. h) 
denote the i— row of U (resp. the product of the diagonal elements of H). Then 

Ya^{[pi:---:Pn]^ P^"' I /^^^ ^P^^'^^^ 
H and, for all i, ul — = Ui and uf G . 
i.e., ^(p2^(p)^h for a?/ pG<^A ((C*)"). I 



where r is the rank of 
Furthermore, (pA is an h-to-1 map. 



The Pi above are sometimes called toric coordinates. The proof of Lemma 3.1.2 is 
a routine application of the Hermite normal form we introduced in the last section. 
Let us see an example of Ya now. 

Example 3.1.1. Taking A as in our last example, we obtain 
ipA{x,y) = [l -.x^-.y: xV : x^y'^] , E^- 



■() 0' 
2 
1 


, and 


■(1 1" 

2 n 1 
(1 1 1 


6 7 




7 r, 1 

6 7 1 
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Using the ihermite command in Maple, we then easily obtain that H - 



is the 



Hermite normal form for E and 



E-- 



is a Hermite factorization 



for E. So Lemma 3.1.i tells us that our Ya here can also he defined as the zero set 
in of the following collection of binomials: 

(Note, for instance, that l^^^{x^ y^Y' ~ {x^Yy^^ — x^^y^^ ~ x^'^y^^ — Q.) Furthermore, 
since h=l ■ 1 ■ our map ipA here is thus a bijection. o 



The most relevant combinatorial/geometric properties of Ya can be summarized 
as follows (see the companion tutorials [Cox03, ^ot03 in this volume for other 
aspects and points of view). 

Definition 3.1.5. Following the notation above, let N and{ai, . . . ,aN}~ 

A. Then, for any face Q of Conv{A), the orbit Oq is the subset 



{[Pi 



Pn]^Ya \ pi^O=^ a,^Q}. 



Also, for any pE Oq with Q a proper face, we say that p lies at toric infinity. 
Finally, given any ./i , . . . , /fc G C[xi , . . . , x„] of the form fi{x) = X^asA '^i,aX°' for all 
i, the zero set of F={fi, . . . , fk) in Ya is simply the set of all [pi : ■ ■ ■ : p^] € Ya 



with YlJ^i Ci,ajPj=0 for all i. o 

Just as a polytope can be expressed as a disjoint union of the relative interiors of its 
faces, Ya can always be expressed as disjoint union of the Oq. The lemma below 
follows routinely from Lemma 3.1.2 and Definition 3.1.4 Recall that a (complex) 
algebraic set is simply a subset of or defined by the zero set of a system 
of polynomial equations. 

Lemma 3.1.3. Following the notation of Definition 3.1.!^ Oq is a dense open 
subset of a d-dimensional algebraic subset ofV^~^, where d = dmiQ. Also, Ya is 
the disjoint union 

U Oq 



Q a face of Conv{A) 



and 



YA\vAiic*r)-^ 



U 



Oq. 



Q a proper face of Conv(A) 



Finally, F has a root in Oconv(A)^ ^if Init^, (i^) has a root in (C*)"^ 



Since all faces of Conv(j4) have a well-defined inner normal. Lemma 3.1.2 thus gives 
a complete characterization of when a root of F lies at toric infinity, as well as which 
piece of toric infinity. This is what will allow us to replace the cumbersome curves 
O{xo,ya,w) mentioned earlier in Example ^.0.5 with a single algebraic curve in Ya. 



3.2. The Smooth Case of Kushnirenko's Theorem. 

Let us now review the last tool we'll need to start our proof of Kushnirenko's 
Theorem: Simplified characterizations of the A-discriminant and Cayley trick, 
and some basic facts on algebraic curves. 
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Definition 3.2.1. Given any k x n polynomial system F, the toric Jacobian 

".xi ■ ■ ■ o 

X2 



matrix of F is ToricJac(_F) = 



Jac(_F). Assuming F is unmixed, 



■ ■ ■ X 

we then say that F has a degenerate root at p=[pi : ••• :pn]gYa iff P is a 

root of F and rank ToricJac(F)|p < k. We then let the discriminant variety, 

^(A A)' denote the set of all 

' - ' (Cl.ai, . . ■ ,Cl,Q„) X • • • X (Cfc,ai, . . . , Cfe,a„ ) € (C^)'' 

such that F has a degenerate root in Ya- Finally, given any k-tuple of point sets 
from M", {Ai, . . . , Ak), its Cayley configuration is the point set 

Cay(Ai , . . . , Afc) (Ai X (0, . . . , 0)) U (yl2 X e„+i) U • • • U (Afc x Cn+k-i ) C o 



Example 3.2.1. Returning to Example 3.0.5 one last time, consider the root 
q=[l : V^: -1:0: 0]eY;4 of 

fi {x, y):=-2t + x^ ~3y + 5x\/ + Ax^yh 

f2{x, y) := 3t + 2x2 ^ ^ ^ ^^1^5 ^ ^x^t, 



where A := Supp(/i) = Supp(/2) 








2 









7 




6 












1 




5 




7 


1 



















1 



Note in particular 



that q e 0(i_2.2) thus lies at the portion of toric infinity corresponding to the 
smallest triangular cell of A^ . The toric Jacobian matrix, in toric coordinates, 
2^2 + 35p4 + 24p5 -3p3 + 25p4 + 28p5 -"^Pi+^Ps 
4p2 + 28p4 + 12p5 P3 + 20p4 + 14p5 3pi + 2p5 



then 



then becomes 
root, o 



-2 



-1 3 



Evaluating at q, our matrix 
which clearly has rank 2, so q is a non-degenerate 



Theorem 3.2.1. Following the notation of Definition 3.2.1, there is a homo- 
geneous polynomial Da S Gfiai , • ■ • , Ca^] such that 

Da{ )^Aa, 
i.e., Aa is always contained in an algebraic hypersurface in . In particular. 



D 



Cay(A, .. .,A}^^ =^ (ci,Qi , . . . , Ci,ajv ) X ' ' ' X (Cfe^ai , ■ • ■ , Cfc^ajv ) ^ ^A'' ■ 



Corollary 3.2.1. Suppose F is an nx n polynomial system with support 
(Ai, . . . , An) and that there is an [n — l)-flat containing translates of Ai, . . . , An. 
Then for fixed (Ai, . . . , A„), F generically has no roots in (C*)". In particular, in 
the unmixed case, F generically has no roots in Ya . B 

Obtaining the discriminant of a system of equations via the discriminant of a single 
larger equation via the Cayley configuration is sometimes called the Cayley trick 



GKZ94|. Theorem 3.2.1 can actually be derived directly from our framework here 



via the toric resultant (see, e.g., [lVIou02| in this volume or [EP02]). However, 
for the sake of brevity we omit the proof. The final additional fact we'll need follows 
easily from the Implicit Function Theorem. 
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Definition 3.2.2. If X CP^^ and Ysubsetec^'^'^ are algebraic curves, then a 
morphism ip : X — > Y is simply a well-defined map of the form [pi : ■ ■ ■ : PN2] ^ 
[01 (pi, . . . iPATa) : • • • : 4'Ni [pi, ■ ■ ■ , ^^2)]; where 0i, . . . , (pN^ are homogeneous poly- 
nomials of the same degree, o 

Lemma 3.2.1. Suppose C C is a smooth algebraic curve (not necessarily 

connected) and ip : C > P^- is any morphism. Then either < 00 or 

■0(X)=P^. In the latter case, there is a positive integer m and a finite set Crit^ C 
Pp, the critical values ofip, such that ^psi~^{t)=m tsP(.\I]. 

Finally, in the special case where C is the zero set in of an n x (n + 
1) polynomial system F{xi, . . . , Xn,t) with Supp(/) C A for all i, A ^ A, and 



T,N2 



ip{ipj^{xi, . . . , x„,t)) = [1 : t] for all t€' 
{F, t — to) has a degenerate root in Y^. 



we have that to (zC lies in Crit. 



Theorem 3.2.2. Following the notation of Theorem 3.0.1 , fix A. Then F 
generically has exactly Yol{A) roots in (C*)", all of which are non- degenerate. 



Proof of Theorem 3.2.2: Let N f^A and {ai, . . . ,a7v} := A as before and 
pick any generic lifting function oj with integral range. Following the notation of 
Definition 3.1.2| , let us then define Ji := A U {(a, w(a) + 1) | a e ^4} and order the 
coordinates p~[pi : • • • : P2n] of P^^ so that 

f^ix t) = [x°'^t^^°'^'^ ■ . . . ■ j-f^N -j-L^idN) . ^ai^uj{ai) + l ..... ^u:{aK) + lj 

Note in particular that F has a well-defined zero set in Y^, as well as Y^: since 
the coordinates pi, . . . ,pn of can be identified with an obvious subset of the 
coordinates of Y^. 

We can now at last define our promised map tt : Y^ — > P^- by p 1-^ [pi : p^v+i]- 
Defining Z (resp. Z) to be the zero set of F in (resp. F in Ya), note that there 
is an isomorphism (an algebraic bijection) between 7r^^(l) H Z and Z defined by 
[pi : ■ ■ ■ : P2n] < — > [pi ■ ■ ■ ■ '■ Pn]- (This is easily checked since Lemma ^.1.2 
tells us that the binomials that define Y^ are exactly those defining Ya union 

{pN+lP2- PlPN+2r ■ ■ ,PN+1PN - PiP2n)-) 

Now note that tt also induces a natural morphism from Z to P^. Let H be 
the Hermite normal form of A and h the product of the diagonal elements of H. 
Since the first n columns of the Hermite normal forms of A and A are the same, 



Lemma 3.1.2 then tells us that the number of roots of F is exactly ft.#(7r ^(1) n Z). 
By applying Theorem 3.2.1 to {A, . . . ,A) (and Proposition 2.0.1) it thus suffices to 
show that /i#(7r-i(l) n Z)=Yol{A) generically 

Next, note that by construction, all the initial term systems of F will be un- 



mixed and have Newton polytopes of volume 0. In particular, by Corollary 3.2.1 



any particular initial term system will generically have no roots. Similarly, by 



Corollary 2.0.2, the initial term systems of F will have each have smooth zero set 
generically. So by Lemma 3.1.3| and Proposition 2.0.1, it will be generically true 
that F will have no roots at toric infinity in Ya, and all the roots of F at to ric 
infinity in Y^ will be non-degenerate. Furthermore, by applying Theorem [3.2.1 
{A, . . . , A), we know that Z is generically smooth. So by Proposition [2.0.1 



to 



again. 



it thus suffices to show that [Z, Z, and Z fl (Toric Infinity in Y^) smooth 
and ZCf^iiC*)")] =J> /i#(7r-i{l) n Z)=Vol(A). 
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So let US now assume the hypothesis of the last implication. By Lemma 3.2.1 
Z (resp. Z n 7r~^(0)) smooth =^ 1 (resp. 0) is not a critical value of tt. Also, by 
the Implicit Function Theorem, the smoothness of Z implies that 7r(Z) contains a 
small open ball about 1. So by the first part of Lemma |3.2.1 , 7r(Z)=Pj.. 

Clearly, remains path-connected even after a finite set of points is removed, 
so let L be any continuous path connecting and 1 in Pj.\Crit(7r|^). By the Implicit 
Function Theorem once more, and the fact that L is compact (by virture of the 
compactness of P^), we must have that j^ij^^^it) n Z) is constant on L. So we 
now need only show that /i#(ir-^(0) n Z)=Vol(A). 

To co nclude, note that A and A have the same lower hull, so Lemmata ^.1.3 
and 3.1.2 then imply that 7r~^(0) n Z is nothing more than 



[Pi 



Ci,ajPj = for all i G {1, . . . , n} for some cell Q of A^ 



In particular, by our smoothness assumption on tt ^(0) n Z, Corollary 2.0.2 tells 
us that 7r~^(0) n Z is actually 



Ci,ajPj = for all ie{l. . . . ,n] for some full-dimensional cell Q of 



a,€C 



Since A^ is a triangulation. Corollary 2.0.2 and Lemma j3.1.2| (along with another 
application Hermite factorization) tells us that 

Vol(g) Vol(A) 



#^-^(o)nz= 



E 



a full-dimcnsional cell of 



SO we are done. 



Remark 3.2.1. Our proof generalizes quite easily to arbitrary algebraically 
closed fields and positive character stic, e.g., the algeb raic c losure of a finite field. 
One need only use a little algebra to extend Lemma 3.2.1 to algebraically closed 
fields and then one can use the same proof above almost verbatim, o 

3.3. Path Following, Compactness, and the Degenerate Case of Kush- 
nirenko. 

Let us now allow degeneracies for the zero set of F and arrive at a strengthening 
of Kushnirenko's Theorem. 



Theorem 3.3.1. Following the notation of Theorem 3.0.1, let Za be the zero set 
of F in Ya, and let {Zi} be the collection of path- connected components of Za- Then 
there is a natural, well-defined positive intersection multiplicity fi : {Zi} — > N 
such that /x(Zi) = Vol(A) and fi{Zi)~l if Zi is a non-degenerate root. 

We actually have all the technical preliminaries we'll need, except for one last 
simple proposition on path-connectedness. 

Proposition 3.3.1. IfTi is any algebraic hypersurface in then B\Tl is 
path- connected for any open ball B in C^. I 



Proof of Theorem |3.3.1 ; Let N ■.= ^A as usual and let P{A) denote the space 
of all polynomials in C[xi, . . . ,Xn] with support contained in A. Clearly, we can 
naturally identify P{A) with C^. Since zero sets of polynomials are unchanged 
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under scaling of the coefBcients, we will then let F{A, ...,A):= (Pc )" be the 

n 

space we'll use to consider our possible F. 

Note now that if i^G(Pc~^)"\^(A,...,A) then we are done by Theorem |3.2.2| 



(simply setting fi{Zi) = l for every root Zi). Indeed, since (P,^ "'^)"\Aj.^ 



path-connected by Proposition |3.3.l| , the Implicit Function Theorem tells us that 
F had better have the same number of roots in Ya as an F which has smooth zero 
set and no roots at toric infinity. 

Essentially the same idea can be used for Fe j^y In particular, for 

such F, let be any sequence such that — > F. Then, letting Z(') be the 
zero set of F^*^ in Ya_, define ( to be any limit point of {Z^'^}. We must then have 
F{() = and thus Za must be non-empty. 

Now let {Ui} be disjoint open sets with Zi C Ui for all i. (Such a set of open 
sets must exist since Ya is compact and the Zi must be of positive distance from 
each other, using the usual distance in P^~^.) Note then that ^aMJi must be 
compact. By the continuity of as a function of its variables and coefficients, 
there must then be a ball B about F in (P^~^)" such that the roots of any G are 
contained in IJj J7i. 

We may now define ^{Zi) as follows: Take any GgB \ A^.^ and define 



fJ.{Zi) to be the number of roots of G in Ui. Since B\A^j^ is path-connected 



by Proposition 3.3.1, the Implicit Function Theorem tells us that the number of 



roots is independent of whatever G&B\ A^^ we picked. 



Remark 3.3.1. Essentially the same argument was used Mike Shub in [ Shu93| 



to prove an extended version of Bezout's Theorem (see also |BCSS98, Pg. 199] 
Note that neither theorem generalizes the other, but the theorems overlap in the 
special case where A is the set of integral points in a scaled standard simplex. On 
the other hand, our next theorem will generalize both Kushnirenko 's Theorem and 
Bezout 's Theorem. The elegance of Shub 's approach is that it gives a rigourous and 
simple approach to intersection theory for a broad class of polynomial systems, o 



Remark 3.3.2. Combining Theorems 3.3.1 and 3.2.i we immediately obtain 



our earlier simpler statement of Kushnirenko 's Theorem (Theorem 3.0.1). o 



4. Multilinearity and Reducing Bernstein's Theorem to Kushnirenko's 

Theorem 

The big question now is how to count the roots of a mixed polynomial system, 
since being unmixed is such a strong restriction. Toward this end, let us consider 
another consequence of the basic properties of discriminant varieties. 

Lemma 4.0.1. Let F and G be any n x n polynomial systems with support con- 
tained in (Ai, . . . , An) component-wise. Then, generically, F and G share no roots 
in (C*)". Furthermore, the number of roots of F is generically a fixed constant. ■ 
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As an immediate consequence, we obtain the following preliminary answer to 
our big question. 

Definition 4.0.1. Let Si, . . . ,Sk be any subsets o/R". Then i/ieir Minkowski 
sum is simply Si + ■ ■ ■ + Sk ■={yi + ■ ■ ■ + Vk \ Ui^Si for all i}. o 

It is easily proved that Newt(/g) = Newt(/) + Newt(g) (once one observes that 
the vertices of Newt(/g) are themselves Minkowski sums of vertices of Newt(/) 
and Newt{g)). So it should come as no surprise that Minkowksi sums will figure 
importantly in our discussion relating polyhedra and polynomials. 

Lemma 4.0.2. Let J\f{Ai, . . . , An) denote the generic number of roots in (C*)" 
of an nxn polynomial system F with support {Ai, . . . , A^)- Then J\f{Ai, . . . , A^) is 
a non-negative symmetric function o/Conv(Ai), . . . , Conv(A„) which is multilinear 
with respect to Minkowski sum. 

Proof: That Af{Ai, . . . , A„) is a well-defined non-negative symmetric function of 



Ai, . . . , A„ is clear (thanks in part to the last part of Lemma 4.0.1 ). The form ula for 
Af{Ai, . . . , A„) in the unmixed case then follows immediately from Theorem 3.2.2. 
Translation invariance follows easily since the roots of F in (C*)" are the same as the 
roots of {x^'fi, x^-fn) in (C*)". Defining cclw.j] := (x"" • • • , . . . , x"'" ■ ■ ■ x"- 
for any nxn matrix [uij], it is then easily checked that Supp(F(x'^)) = {UAi, . . . , UAy-, 
and U unimodular (cf. Definition 2.0.1) => the map x x^ \s, an analytic bi- 



jection of (C*)" into itself. So it is clear that U(UAi, UAn) =N{Ai, . . . , An). 

We thus need only show that A/" is a multilinear function of the convex hulls. To 
see the multilinearity, note that the zero set of (/i/i, /2, . . . , fn) in (C*)" is exactly 
the union of the zero sets of (/i, /2, . . . , fn) {fi, f2, ■ ■ ■ , fn)- So by the first part of 



Lemma 4.0.1, and the symmetry of A/", multlinearity follows. 



Recall now the polarization identity: 



m{xi,...,Xn)= ^ (-1)" f " j m I ^Xi, . . . ,^a;, j , 

valid for any symmetric multilinear function. (The identity is not hard to prove via 
inclusion-exclusion [ GKP94 1 . See also [ Gol03| in this volume for another point of 



view.) Therefore, we must have 
AA(Ai,...,A„) 



0#/C{l,...,ri} ^ i£l 



and thus M{Ai, . . . , An) depends only the convex hulls of Ai, . . . , An. ■ 

So we have answered our big question, assuming we know a function Ai{Pi, . . . , P„), 
defined on n-tuples (Pi, . . . ,Pn) of polytopes in R", that satisfies the obvious ana- 
logues of the properties of Af{Ai, . . . , An) specified in Lemma 4.0.2[ However, such 



a function indeed exists: it is called the mixed volume and we denote it by A4{- 
Abusing notation slightly by setting (Ai , . . . , A„) ■.^M{Conv{Ai), . . . , Conv(A„)), 
we immediately obtain the following result. 

Theorem 4.0.2 (Bernstein's Theorem). Suppose F is any nxn polynomial sys- 
tem with fixed support Ai, . . . , An. Then F generically has exactly M{Ai, . . . , An) 
roots in (C*)". 
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Of course, we now appear to 
volume? This we now answer. 
Remark 4.0.1 




Felix Minding 



have an even bigger question: what is mixed 



David N. Bernstein was the first to prove 



Theorem ^.0.1 in a slightly stronger form, 
along with an algebraic condition for when 
F wo uld have exactly mixed volume many 
roots [Ber75] ■ Interestingly, Felix Mind- 



Heiiriariii 
Minkowski 



ing appears to have been the first to prove 
the special case n = 2 in 1841, and 
mixed volume wasn't even defined until 
near the end of the 19— century by Her- 
mann Minkowski, o 



We also point out that Minkowski was born on 22 June, 1864, in a town named 
Alexotas. This town used to belong to what was the Russian empire at the time 
but is now the Lithuanian city of Kaunas. 



5. Mixed Subdivisions and Mixed Volumes from Scratch 

Let us begin with an illustration of one of the simplest non-trivial examples of 
a Minkowski sum: 




There are many different dcfintions of mixed volume but the two most im- 
portant use Minkowski sums in an essential way. More to the point, if one can 
subdivide Pi + • • • + P„ in a special way, then one is well on the way to computing 
mixed volume. This is where mixed subdivisions enter. 



Definition 5.0.2. |HS95| Given polytopes Pi ,. . 
of (Pi,...,Pfe) is a finite collection of k -tuples {(Cf, 
following axioms: 



,Pk C M", a subdivision 

•■iC'^)}aGS satisfying the 



(1) {JaesCr=P^forall^ 

(2) Cf n cf is a face of both Cf and Cf for all a,(3,i. 

(3) cf a face of Cf for all i^C^,...,C^ have a common inner normal. 

Furthemore, if we have in addition that X^i "^i™ = Si '^'^ then we 

call {(Cf , . . . , C^)}aes o, mixed subdivision, o 
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Example 5.0.1. 




-40 -20 20 40 60 80 100 120 

The mixed area is 9794 



Here we see a very special kind of subdivision {Qi} of the Minkowski sum of 
two polygons Pi and P2 , each with many vertices. In particular, the subdivision of 
Pi + P2 above is built in such a way as to encode a mixed subdivision {(Cf ,C^) 
of (Pi,P2)- In particular, we see that each Pi has a distinguished vertex Vi, and 
that we can read off a mixed subdivision of (Pi,P2) ols follows: there are two cells 
{Pi,V2) and (vi,P2), corresponding to the two cells Pi + V2 and vi + P2 of {Qi}. 
The remaining cells o/{(Cf ,C^) are parallelograms of the form (Ei,E2) where Ei 
is an edge of Pi for all i. o 

It is easily verified that any subdivision of (Pi, . . . , P^) immediately induces a 
subdivision of (APi, . . . , APfc), for any Ai, . . . , > 0. 

Example 5.0.2. 




-40 -20 20 40 60 80 100 120 

The mixed area is 9794 



Note in particular that the areas of the cells of our induced subdivision of APi + 
I1P2 scale according to their type. In particular, it is clear that for our above 
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example, Area(APi + /iP2) = Area(Pi)A^ + 7\/A^ + Arca(P2)/i^, where M is the sum 
of the areas of all the mixed cells (the parallelograms) . o 

Definition 5.0.3. Following the notation above, the type of a cell (Cf , . . . , C^) 
of a subdivision of (Pi, . . . , P„) is simply the vector (dimCf , . . . , dimC^). In par- 
ticular, the cells of type (!,...,!) are called mixed cells, o 

The following two lemmata are then immediate. The first follows from a slight 



modification of the proof of Lemma 3.1.1, while the second follows almost immedi- 
ately from the first. 



Lemma 5.0.3. Following the notation of Definition 3.1.1 , recall that 

TT : M"+i — > R" 

is the natural projection which forgets the last coordinate. Then, given finite point 
sets Ai, . . . , An C Z" and lifting functions LOt for Ai for all i, the collection (Ai, . . . , A„) 
{(7r(Conv(ii)^"), . . . ,7r(Conv(i„)"')) | w€R"\{0}} always forms a subdivision of 
(Conv(Ai), . . . , Conv(A„)) — the subdivision of (Conv(Ai), . . . , Conv(A„)) in- 
duced by (wi, . . . ,uin)- In particular, for fixed {Ai, . . . , An), {Ai, . . . , An)ui will 
generically be a mixed subdivision. M 

Lemma 5.0.4. Por Ai . . . , A„>0, and any polytopes Pi, . . . , PnGM" , the quan- 
tity Q{Xi, . . . , A„) := Vol (X^iLi ^il^i) homogeneous polynomial of degree n with 
nonnegative coefficients. ■ 

We then at last arrive at the following definition of the mixed volume. 

Definition 5.0.4. Given any polytopes Pi, . . . , P„ (l R", their mixed volume is 
the coefficient o/ A1A2 • • • A„ in the above polynomial Q{Xi, . . . , A„). o 

Example 5.0.3 (The Unmixed Case). It is easily checked that M{P, . . . , P) = 
Vol(P). Note also that the multilinearity of Ai{-) with respect to Minkowski sum 
also follows immediately from the preceding definition, o 

Example 5.0.4 (Line Segments). It is also easily checked that Ai{{0, ai}, . . . , {0, a 
I det[ai, . . . , a„]|, where ai, . . . , an are any points in R" and [ai, . . . , a„] is the ma- 
trix whose columns are ai, . . . , a„. o 

The next two c haracteri zations follow easily from the last two lemmata, and 



inclusion-exclusion |GKP94 



Lemma 5.0.5. For any mixed subdivision {(Cf , . . . , C")} of (Pi, . . . , P„), 



M(Pi,...,P„) := E Vol I J. 

(Ci,...,C„) \ I I 

a cell of type 

Furthermore, we have A^(Pi, . . . , P„) := ^ (-l)"-#-^Vol ( E ) ' * 

0#/c{i,...,ji} Vie/ / 

Example 5.0.5 (Cornered Spikes). A less trivial puzzle is the following formula: 
7W({0, aiiei, ai„e„}, . . . , {0,aiiei, ai„e„}) = max I n ai<T(j)|' where aij are any 

non-negative real numbers and a ranges over all permutations of {1, ... ,n}. This 
gives some indication that mixed volume includes many simple functions as a special 
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case. However, our next example shows that mixed volume includes rather non- 
trivial functions as well, o 

Example 5.0.6 (Bricks, a.k.a. the fine muhigraded case). Via multilinearity, it 
easily follows that A^([0, dn] x • • • x [0, din], • ■ • , [0, dni] x • • • x [0, c?„„]) = Perm[(i.ij], 
where Perm denotes the permanent .f] In particular, this immediately shows that 



computing mixed volume is ^P-hard [pap95 , DGH98|. o 



Let us now finaUy prove Theorem 2. 
Proof of Theorem 2: Note that by Bernstein's Theorem, it suffices to find an 
algorithm for computing M{Ai,A2) with bit complexity 0{bN + NlogN). The 
main idea of the proof can then already be visualized in the first mixed subdivision 
we illustrated: one computes the mixed area of [Ai, A2) by first efficiently comput- 
ing the convex hulls of Ai and and then expressing the sum of the areas of the 
mixed cells compactly without building the entire mixed subdivision. This is not 
a contradiction, provided one views the mixed cells in the right way. 

More precisely, first recall that the convex hulls of Ai and A2 can be computed 
within 0{N \ogN) bit operations, via the usual well-known 2-dimensional convex 



hull algorithms [PS85]. In particular, with this much work, we can already assume 
we know the inner edge normals of Pi := Conv(Ai) and P2 '■= Conv(A2), and the 
vertices of Pi and P2 in counter-clockwise order. 

Let us then pick a vertices vi G Pi and V2 G P2 such that their angle cones are 
disjoint. Then there is a mixed subdivison (which we will never calculate explicitly!) 
with exactly 2 non-mixed cells — (Pi, V2) and (ui, P2) — and several other mixed 
cells. (This is easily seen by picking a lifting function loi for Pi that is identically 
zero, and a linear lifting function lj2 for P2 that is minimized at V2 and is constant 
on a line that intersects the angle cones of vi and V2 only at the origin.) 

Note then that the union of the mixed cells can be partition into a union of 
strips. In particular, by construction, there are disjoint contiguous sequences of 
edges {Ef^, E'i}) and {e'('\ s'i'J), with £;f ^ and e'^'^ incident to u,, for 
all i. Furthermore, every mixed cell of {Ai,A2)u> is of the form (ij|^\ sj^-*) or 

(i?j-^ E^^ and every E^' and £'j' ' is incident to some mixed cell. 

The partition into strips then arises as follows: the mixed cells of (^1,^2)^ 
can be partitioned into lists of one of the following two forms: 

(i?j^i?(^)),...,(i?f\i?^)) 

(<\ <)),..., (<\<)) 

where j e {1, . . . , ai} (resp. j € {f , . . . , oi/}), ruj < Uj, and nj < 02 (resp. nj < 02'. In 
particular, the union of the mixed cells in any such list is simply the Minkowski sum 
of a continuous portion of the boundary of P2 and an edge of Pi , and its area can 
thus be expressed as the absolute value of a determinant of differences of vertices 
of the Pi. Furthermore, each formula can easily be found by a binary search on the 
sorted edge normals using O (NlogN) comparisons. 

Since there arc no more than such strips, the total work we do is bounded 
above by the specified complexity bound, so our upper bound is proved. 



nating signs in the full determinant expansion are replaced by +l's. 
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To obtain our lower bound, note that the mixed area of {Ai,A2) is zero iff [[Pi 
or P2 is a point] or [Pi and P2 are paraUel hne segments]. So just knowing whether 
the mixed area is positive or not amounts to a rank computation on a matrix of 
size 0{N) and thus can take no less than ^l{bN) bit operations in the worst case 
|BCS97| . ■ 



6. A Stronger Bernstein Theorem Via Mixed Subdivisions 



Here we prove the following generalization of Theorem 3.3.1. It is at this point 
that we will use a slightly more high-brow type of toric variety: the toric variety 
Xp corresponding to a polytope P. In essence, the key properties that we needed 
from Ya (that it compactify (C*)" and have a partition into orbits corresonding to 
the faces of a polytope) continue to hold for Xp. We make this change to avoid 
technicalities in defining the zero set of F in Yaih when F is mixed. Since 



Xp is discussed elsewhere in this volume at greater length [Cox03, Sot03], we 
proceed with the statement of our theorem. 



Theorem 6.0.3. Following the notation of Theorem 4-0-^ , let F :=Co-av{ A 



• • • + Conv(A„), let Zp be the zero set of F in Xp, and let {Zi} be the collection 
of path- connected components of Zp. Then there is a natural, well-defined posi- 
tive intersection multiplicity /i : {Zi} — > N such that = Vol(^) and 
li{Zi) = l if Zi is a non- degenerate root. 

The proof will be almost exactly the same as that of our extended version of 
Kushnirenko's Theorem, so let us first see an illustration of a toric deformation for 
a mixed system. 

Example 6.0.7. Take n — 2 and 

fi{x,y) :=:ci,o + Ci,(a,o)a;" + Ci,(o,/3)?/'^ + Ci,(„,^)a;"/ 

f2{x, y) :=C2,o + C2,(^,o)a^'' + C2,(o,5)y^ + C2,('i ,S)x''' ■ 

By Bernstein's Theorem, the number of roots should be a5 -f (3^ , so let us try to 
prove this. 

Let us take the following lifting of F: 

fi{x, y, t) :=ci,o + Ci,(a,o)a;"* + Ci,(o,/3)y''i + Ci^(a,p)x"'y'^ 



hix, y) :=C2,ot + C2,(7,0)S'^ + C2,(o,5)y'' + C2,(-l,S)X'V^i 

In particular, we see that there will be exactly one mixed cell for {Ai,A2)i^ and its 
corresponding initial term system will be 

Iiiit(o,o,i)(^)(a;, 2/,i) = (ci,o + ci,(„_^)a;"2/^, 02,(^,0)2;'' + C2,(o,<5)y'') 
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7X (Minkowski Sum) 




The idea of our proof of Bernstein's Theorem then mimics our ear lier proof 

of Kushnirenko's Theorem: our lifting induces a lifted version P = Conv(Ai) + 
Conv(A2) of P and we'll then try to build a map from our lifted zero set to the 
projective line. To do so, we'll define P := P x [0, 1] and this is illustrated below. 





= 



In particular, the only portion of the low^muTl of P (i.e., the "lower portion" 
of toric infinity on Xp ) which is touched by zero set of F in Xp is the parallelogram 
facet, and the projection of this facet has area exactly a5 + (3^. o 



Proof of Theorem B.0.3; We will first prove the generic case, and then derive 
the degenerate case, just as we did for the unmixed case. 



At this point, we could just use Theorem 4.0.2 to get the generic case and 



proceed with our proof of the degenerate case. However, let us observe that we could 
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instead use mixed subdivisions to directly obtain Theorem 4.0.2 without reducing 
to the unmixed case. The proof proceeds exactly like the proof of Theorem 3.0.1 , 
except for the following differences: 

(1) We work with Xp instead of Y^, where P= (A H + Pn) x [0, 1]. 

(2) The map tt is essentially the same but is instead defined via the Cox 



coordinate ring [Cox03| 



(3) The only portions of toric infinity in Xp that intersect tt ^ (0)nZ arc those 
corresponding to facets on the lower hull of P that project to mixed cells 

of (Pl,...,P„)a.. 

(4) The final count of roots becomes a sum of roots of a collection of binomial 
systems. 

To prove the degenerate case, we proceed exactly as in the proof of Theorem 
3.3. 1| , except with the following minor modifications: 

(1) We use the notational changes above. 

(2) The space of F wc work with is instead P^^^^ x • •• x P^"^^, where 
N 1 = 41^ A, for aU i. ■ 

We are now finally ready to state and prove the full version of Theorem 1: 
Theorem 1 (Full Version) Suppose F is any k x n polynomial system with 
Supp(/i) C c (N U {0})" for all i and let Zc{F) denote the zero set of F 
in C". Then the number of connected components of Zc{F) is no more than 

Vol({0,ei,...,e„)uULi^i) or ({O, ei} U Ai, . . . , {O, Cfe} U A^), according 
as k <n or k>n. 

Proof: Let B -.— {O^ei, . . . ,e„) U 1J*L ^ Aj. If k<n then we can simply set fk+i - 



fn = fi and then apply Theorem 3.3.1, noting that the Supp(/i)C_B for all i. 



In particular, it is easily checked that Yg actually contains an embedded copy of 
C". 

To prove the case k > n, note that wc can reconsider such an _F as a fc x fc 
polynomial system with Supp(/i) C {0, a} U Ai for all i. Once again, by virtue of 
the fact that P := Conv(^i) + • • • + Conv(Afc ) contains the non-negative orthant 
as one of the cones in its normal fan [|Cox03| , we have that our underlying toric 



we re 



variety (this time, Xp) has an embedded copy of C". So by Theorem 6.0.3 
done. ■ 
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